Sorting and Order Statistics
نویسنده
چکیده
We will analyze the number of comparisons needed to solve the problem of sorting n distinct items into ascending order, and the related problem of selecting the kth smallest item from an unsorted list of n items. Section 1 examines mergesort, an algorithm with excellent worst case performance, and Section 2 examines quicksort, an algorithm with excellent average case performance. Section 3 contains some decision tree lower bounds on the number of comparisons used by any comparison-based sorting algorithm in the worst and average cases. Section 4 contains two linear-time selection algorithms, one average case and one worst case, and a lower bound based on the adversary method.
منابع مشابه
2 4 Ju n 20 07 On probabilities for separating sets of order statistics ∗
Consider a set of order statistics that arise from sorting samples from two different populations, each with their own, possibly different distribution function. The probability that these order statistics fall in disjoint, ordered intervals, and that of the smallest statistics, a certain number come from the first populations, are given in terms of the two distribution functions. The result is...
متن کاملA spike sorting framework using nonparametric detection and incremental clustering
We introduce a statistical computing framework to address two important issues in spike sorting: flexible spike shape modeling and realtime spike clustering. In this framework, spikes are detected based on a nonparametric shape distribution; detected spikes are further grouped by an incremental clustering algorithm involving the second-order statistics–covariance matrix. We performed experiment...
متن کاملطبقهبندی پتانسیلهای عمل نرونی با استفاده از شبکههای عصبی شعاعی
Background: Studying the behavior of a society of neurons, extracting the communication mechanisms of brain with other tissues, finding treatment for some nervous system diseases and designing neuroprosthetic devices, require an algorithm to sort neuralspikes automatically. However, sorting neural spikes is a challenging task because of the low signal to noise ratio (SNR) of the spikes. The mai...
متن کاملLanguage Identification from Text Using N-gram Based Cumulative Frequency Addition
This paper describes the preliminary results of an efficient language classifier using an ad-hoc Cumulative Frequency Addition of N-grams. The new classification technique is simpler than the conventional Naïve Bayesian classification method, but it performs similarly in speed overall and better in accuracy on short input strings. The classifier is also 5-10 times faster than N-gram based rank-...
متن کاملThe Virtual Corpus Approach to Deriving Ngram Statistics from Large Scale Corpora
This paper reports our implementation of the Virtual Corpus approach to deriving ngram statistics for ngrams of any length from large-scale corpora based on the suffix array data structure. In order to enable the VC to accommodate corpora with a vocabulary of different size, we first convert corpus tokens into integer codes. To accelerate the processing, we employ a bucket-radixsort for sorting...
متن کامل